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Date procoatfng system with a stays co mput er* 

® ma date processing system hai^ 
nectJng to a host centra] pr oces si n g unit (81) and a host main 
memory (85), the stave computer tnctudes an arithmetic topje 
operattng means (g)» The arttt s netts tagte operating means 
(®) catct&Btes ah address of me host main memory (35). A 
DMA interface 03) drectty mafees an access to thorax mam 
memory (38) to team operand data Into me stave computer. 
The arithmetic teoje oparal&tg means (28? computes me ope- 
rand data under Gerard of a naBroprmjram control section (87) 
and tflnecdy toads the computed resuft to tha host mam memo- 
ry (35) through the DMA K 
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Data 



processing system wi th a slave cam p »^ 

The present invention relates to an electronic 
computer system and more particularly to a computer 
system which is the combination of an ordinary computer 
system and an auxiliary computer system whereby the 
process of which the execution by the ordinary computer 
is made merely at a low speed is carried out by the 
auxiliary computer additionally used and therefore the 
computer system operates at a high speed. 

By convention, the computation as an excessive task 
for the host computer is frequently assigned to an 
additionally used computer system, or a slave processor, 
appropriate for the computation. Such a slave computer 
system is commercially available now. a typical example 
of such is an array processor which execute* the 
computation of array type data at high speed, such as an 
array processor (AP 120B) by Floating Point Systems Inc. 
in USA and ap 400 by Analogic Inc. in USA* The detail 
of the AP 400 is discussed in US Patent No. 4,135,247 
granted to Gordon et al on January 16, 1979. Bach of 
those systems has a high speed memory and an arithmetic 
logic unit, m operation, the computation data is 
fetched from a main memory of a host computer and the 
fetched data is loaded into the high speed memory and 
then is computed by an arithmetic logic unit. The 
computed data is loaded into the high speed memory and 
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afterwards is transferred to the main memory. In this 
way, a sequence of the data processing operation is 
completed. This type apparatus is effective for an 
operation having many intermediate results of 
5 computation such as a First Fourier Transform but is 

improper for a relatively simple processing in which the 
data in the main memory is subjected to a simple 
computation with little production of the intermediate 
computation result. The reason for this is that if the 

10 high speed memory holds the intermediate computation 
results and the results are accessed many times, the 
high speed feature is effectively used; however, with 
■ respect to the processing requiring no intermediate f 
computation results,' it operates merely as a buffer - . 

15 between the main memory and the arithmetic logic units 
Thus, it is ineffective in utilizing the high speed # 

feature thereof. *~ 

When ci » a 4 + b L , i = 1, 1000 is computed by us%ig 
the conventional array processor (ai and b ± are stored* 

20 in the main memory and the memory location for storing? 
Ci is also in the main memory), the data &± and b t are, 
read out from the main memory and are transferred to an 
internal memory. Then, the addition for obtaining C£ is 
performed by an internal arithmetic logic unit. The 

25 result of the addition is stored in the internal memory. 
After all of the results of the additions are computed, 
the total result is returned from the internal memory to 
the main memory. This operation is diagrammatically 
illustrated in the time scale in Fig. 1. 

30 As a first step, the array data a t is read out from 

an address , which is generally needed at the time of the 
set-up and given in some way, and is transferred to the 
memory of the slave processor. This step is performed 
in a section a. The necessary address computation in 

35 this case is performed by the slave processor. In the 
next step, the array data bi is likewise fetched from 
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the main memory. This is performed in a section e. 
When the fetching of all of the array data aj_ and b ± is 
completed, the slave processor reads out the array data 
ai and b L from the internal memory, adds those and loads 
again the result c t into the internal memory by using 
the internal arithmetic logic unit. Normally, the 
internal memory and the arithmetic logic unit of the 
slave computer are higher than the host computer, and 
thus processing in this section is at high speed. This 
operation is performed in a section Y . When the 
computation (the addition in this case) of all of the 
array data are completed, the result of c A is 
transferred (loaded) from the internal memory to the 
main memory of the host computer. This operation is 
15 performed in a section y. Assuming that an access time * 
of the main memory is r am , and a processing time of the 
arithmetic logic unit is x pr , the total of the 
computation time (provided that the main memory could be 
accessed exclusively), is given 
20 lOOOTaro+lOOOx^+lOOOTp^lOOOT^ = 3000x^+1000^ 

Therefore, even if the arithmetic logic unit is 
operable at high speed (x pr +0), the arithmetic time is 
determined by the access time 3000 Tam to the main memory. 
Accordingly, it is undesirable that the internal high 
speed memory is particularly provided in the slave 
computer for effecting such a processing. This is for 
the reason that the high speed performance little 
contributes to the reduction of the processing time and 
that the high speed internal memory is expensive. 

Accordingly, an object of the present invention is 
to provide a computer system with a good performance 
cost. 

Another object of the present invention is to 
provide a computer system with a slave computer having 
35 no internal memory. 

To achieve the above objects, there is provided a 
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data processing system with, a slave computer connecting 
to a host central processing unit and a host main memory 
characterized in that the slave computer comprising: 
arithmetic logic operating means of which the operation 
cycle is higher than the memory cycle of the host main 
memory, and computes the address of the host main 
memory; DMA interface means which is connected to the 
host main memory and the arithmetic logic operating 
means, directly makes an access to the host main memory 
in accordance with the address of the host main memory 
computed by the arithmetic logic operating means, and 
directly store the operation result of the arithmetic 
logic operating means in the host main memory* " 
microprogram control means connected to the arithmetic 
logic operating means and the DMA interface means for * 
controlling those means/ input/output interface means 
connected to the host central processing unit and the 
microprogram control means to apply command inf orraatiojn 
received from the host central processing unit to the J 
microprogram control means; and clock circuit means 
connected to the arithmetic logic operating means, the 
microprogram control means and the DMA interface means^ 
for generating system clocks to operate those means* 

In the present invention, a slave computer system 
with no internal memory, which is expensive, used for 
the array data computation, is connected to a host 
computer. The slave computer computes the data obtained 
by directly making an access to the main memory of the 
host computer system through an interface, and loads the 
result of the computation to the main memory through the 

interface, again* 

In a preferred embodiment of the present invention, 
the slave computer has an arithmetic logiq unit with a 
processing speed several times the memory cycle of the 
main memory, which the arithmetic logic unit is 
controlled by a microprogram control unit* * The 
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microprogram control unit has a number of microprogram 
routines corresponding to the computations. The host 
computer system drives the slave- computer system to 
perform the computation having an extremely long time 
when it is made by the host CPU. Specifically, the host 
CPU gi ves a microprogram routine to be executed and 
information to indicate where operand data is stored in 
the main memory as well, to the slave computer. 
Afterwards it gives a computation start command signal 
to the slave computer, so that the slave computer starts 
the processing related, within one memory cycle, the 
arithmetic logic unit executes a plurality of steps of 
the processing including the address computations of the 
data to be read from or written in the main memory, as 
well as the normal computation. A great amount of data " 
in- the main memory is accordingly processed in a 
Pipeline manner. For the pipeline processing, the slave 
computer must be synchronized with the main memory. To 
this end, the slave computer has an internal clock 
generator and means for inhibiting the oscillation of 
the internal clock for synchronizing with the memory 
cycle of the main memory. 

Other objects and features of the invention will be 
apparent from the following description taken in 
connection with the accompanying drawings, in which: 

Pig. 1 is a time scale useful in explaining the . 
computation of the array data by the conventional 
technique; 

Pig. 2 is a block diagram of an embodiment of an 
electronic computer system according to the invention; 

Pig. 3 is a block diagram of a detailed DMA 
controller shown in Pig. 2; 

Pig. 4 is a memory map of a host main memory shown 
in Pig. 2; 

3 5 

Pigs. 5A and 5B cooperate to form a flow chart of a 
set-up sequence for computing the data array, which is 
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useful in explaining the operation of an embodiment of 
the computer system according to the invention; 

Pig. 6 is a memory map of the microprogram shown in 
Pig. 2; 

Pig. 7 is a block diagram of a detailed branching 
condition circuit shown in Pig. 2; 

Pigs. 8 A and 8B cooperate to form a flow chart of a 
computing sequence of the array data; 

Pig. 9 is a format of a microinstruction used in 

the embodiment; 

Pig. 10 is a timing chart illustrating timings of 
the operation of the embodiment shown in Pig. 2; 

Pig. 11 is a block diagram of a clock circuit shown 
in Pig. 2; and ■ - • . 

Fig. 12 graphically illustrates relations of ^ 
addresses and mnemonic contents. . ■ -* / 

Reference is made to Pig. 2 illustrating a 
construction of a slave computer comprising an I/O ^ 
interface 21, a DMA interface 23, an arithmetic logic ^ 
section 25, a microprogram control unit 27 and a clock ^ 
circuit 29. The slave computer is connected through an 
I/O bus 33 to a host CPU 31 and through a memory bus 37^ 
to a host main memory 35. A host computer system used 
in the present embodiment is a conventional ordinary 

mini-computer. 

The I/O interface 21 provides an interface between 
the slave computer and the host computer through the I/O 
bus 33 and gives a microinstruction (start address) to 
the slave computer and an interruption to the host 
CPU 31. 

The data fed from the host CPU 31 is stored in an 
IDTA register 39. The I/O interface 21 has a flip-flop 
GO 41 and a flip-flop DR 43 for indicating a state of 
the I/O interface 21. 

A DMA interface 23 fetches computation data "from a 
main memory 35 or loads the result of computation by the 
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arithmetic logic section 25 to the main memory 35, which 
the interface 23 has a DMA controller 47, and various 
regxsters such as MAR 49, MDR 51 and MSR 53. The MAR 49 
holds an address to fetch data from the main memory 35 
or an address to store data into the main memory 35. 
The MDR 51 holds the computation result to be stored 
into the main memory 35. The MSR 53 holds the 
computation data fetched from the main memory 35. 

The DMA controller 47 is comprised of an RS 
flip-flop 48 and a D flip— flop 50, as shown in Pig. 3. 
The RS flip-flop 48 holds a memory request signal for 
the host main memory 35. The D flip-flop 50 holds a 
read/write signal for the host main memory 35. The RS 
flip-flop 48 is set by the memory request signal and 
reset by an acknowledge signal derived from the host 
main memory 35. The D flip-flop 50 is set by the 
acknowledge signal. The acknowledge signal, when the 
memory request signal and the read/write signal are 
supplied to the host main memory 35, is an answer back 
signal outputted from the host main memory 35 to the DMA 
controller 47. The memory request signal and the 
read/write signal are produced from a pipeline register 
63 of a microprogram control unit 27 to be described 
later. The DMA controller 47 bypasses the acknowledge 
25 signal to a clock circuit 29 to be described later. 

An arithmetic logic section 25 has an ALU 55 and a 
register file 57 which executes the computation directed 
by a microprogram control section 27. The embodiment 
uses four 4-bit slice bipolar microprocessors (Am 2901) 
manufactured by AMD Co., USA, an ALU of 16 bits and 
16 registers. 

A microprogram control section 27 comprises a micro- 
program sequencer 59, a microprogram memory (ROM) 61, a 
Pipeline register 63 and a branch condition circuit 65. 

A microprogram sequencer 59 such as a microprogram 
sequencer (Am 2910, manufactured by AMD Co. , USA, has an 
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address register 67 therein. The microprogram memory 
( ROM) 61 holds an operation routine performed by the 
arithmetic logic section 25 and a set-up routine to be 
described later. By an address held by the address 
register 67, a specific microinstruction in a specific 
routine is read out from a microprogram memory (ROM) 61 
and is loaded into a pipeline register 63. The 
arithmetic logic section 25, a DMA interface 23 and the 
various registers are controlled in accordance with the 
microinstruction. The branch condition circuit 65 
provides a branch command to the microprogram sequencer 
59 to branch the microprogram in accordance with various 
flip-flop states within the arithmetic logic section 2$, 
and the I/O interface 21. - t 

The clock circuit 29 has a function as a ^ 
synchronizing circuit with respect to the memory access., 
as described later, as well as a function to send a clock 
signal CP to various sections of the slave computer. , 
The operation of the computer system as mentioned^ 
above will be described in detail. As mapped in Pig. 4, 
the main memory 35 stores 1000 data a£ (i - lr 1000) in 
addresses 1000 to 1999, 1000 data bt (i - 1* 1000) in 
addresses 2000 to 2999, and the addition result 
C£ (i =1, 1000) of those corresponding data in 
addresses 3000 to 3999. 

Firstly, the host CPU 31 provides a set-up 
operation. In Figs. 5A and 5B, a flow chart of a set-up 
sequence is illustrated. 

The set-up sequence starts in response to an, 
initialize command delivered through the I/O interface 
21 from the host CPU 31. The initialize command 
initializes the various flip-flops (including those not 
shown) within the slave computer and the microprograms 
as well. In the ROM 61, the set-up routine and various 
arithmetic routines are stored as shown in Fig. 6. In 
response to the initialize command, the set-up routine 
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starts. Then, the host CPO 31 transfers a code 
representing =i - a £ + bi to the IDTA register 39. 

iTIL f IDTA re ' 1Ster 39 3 16 - blt »"«,, «>e cod. 
13 transferred two tines and is set in the registers 
IOTAO 3>1 and IDTAL 39 2 having each an . 8-bit width 

In a block 81, the I/O interface 21 checks if the 
data ,8-bit data of 16 bl t s) dellwea fron ^ 

TJ, T"" ° C " 0t an<3 ' " TOt h°W3 the 

wait loop till the data appears. I, the data is present 
the I/O interface 21 loads the data into the IDTAU 
register 39l (block 83,. Then, in a block 85, the I/O 

8-bit data of the 16 bits, is present or not. if I t ls 
absent, the interface 21 holds a wait loop till the daL 
15 appears, if the data is present ««. ■ / . - 
m.. i present, the interface 21 loads 

the data into the idtal register 39 2 (block 87). The 
code of the idta register 39 thus obtained is a start 
address of the arithmetic routine denoted as #2 for 
compute Ci - a t + b t . when data is set in the IDTA 
ZIT'V 9 ' DR « 13 set and the set-up 

the add *"*' C ° ntentS tte 39 into 

the address register AR 67, when the DR flip-flop 43 
detects the set (block 89). 

CPn ^ 1/0 lntar£are " =»a*s if the host 

iTJi r 3 Wansferred the table address (8 bits of the 
host « « "f T ? addr9SS U '""'erred the 
Trll T ' 1/0 lnterfa< * " «=eives and loads it 
into the idtao register 39 x (block 93). The i/o 

Tblts'T 21 reCetV9S ^ ta " e aadteM ot «>* "fining 
8 bits fro* the host CPO 31 and loads it into the IDTAL 
register 39 2 (blocks 95 and 99). The table address 

tTrtTZ i0 MhiCh ' taMe "presenting the 

" " aadte sses Of the data a t , b t and c t (addresses 

s s;ored 0 i in fK 300< " "* ** nUnbet * 1000, °* «» *" a ' 
st °red in the main memory 35. 

As shown in Pig . 4 , if ^ tM& u ^ ^ 



20 



25 



SOOOD: <EP_pa2l3B5AajL>. 



0021365 



- 10 - 

addresses 500 to 503, the data •500* is loaded as a 
table address into the IDTA register 39. The data in the 
IDTA register 39 is defined by the table handled in the 
microprogram, so that the contents of the table changes 
as the kind of the operation to be executed changes. 

Through the above-mentioned operation, the data 
necessary to start the operation are set up and 
therefore the host CPU 31 sends a GO command to the I/O 
interface 21. The GO flip-flop 41 is set by the GO 
command. A state of the GO flip-flop 41 has been 
supplied to the branch condition circuit 65. As shown 
in a block 99 r when the set-up routine detects this, the 
microprogram is branched to an address held in the 
address register 67 and the execution of the arithmetic 
routine #2 starts (block 101). 

The branching condition circuit 65 will briefly bS 
described referring to Fig. 7. In the figure, the 
branch condition circuit 65 is comprised of a condition 
code selector 69 and an polarity control exclusive 
OR 71. The selector 69, which has been supplied with 
the information of the flip-flops, selects the contents 
of any of the flip-flops. Upon the selected 
information, the exclusive OR 71 supplies a signal to 
direct the branch to the microprogram sequencer 59. In 
this case, if the arithmetic routine #2 is allocated in 
the address 100 of the ROM 61, n 100» set in the pipeline 
register 63 is supplied to the ROM 61 through a selector 
(not shown) and a branch to the address 100 takes place. 
Of course, the address 100 is related to the 
microprogram and not to the main memory. The arithmetic 
routine #2 starting at the address 100 is performed as 
shown in Fig. 8, for example. The central portion of 
such a program, or an addition loop, is of the usual 
type. In a block 103, the contents of the IDTA register 
39 is loaded into the arithmetic logic section 25. The 
first address of a table necessary for starting the 
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operatxon as prevxously stated has been prestored i„ the 
id™ register 39. In this example, an address "500- is 
loaded into the arithmetic logic section 25. The 
arithmetic logic section 25 transfers the address -500" 

35 through the D« a interface 23. Here, -Read- indicate 
a fraction for the slave computer to read (data flows 
main TOmory to slave processor). l„ response to the 
10 T T"""' tte lnt «*>~ » reads the contents of 

T ° f Min ^ 35 < 10 °0 - the start 
address of the array a t , in this example,, and ,. t s Z 

WicT IT 1,8,1 " <blOCk 105) - ■» arithmetic 
logic section 25 Xoads the first address of the arrav 
ai , or the contents of the MSR S3, into the internal 

HI T" Si " il " 1 K' incrementing the address, 

the first address of the b t array, the first address of 
the c t array, and the number of the operations are 
sequentially loaded into the register of the arithmetic 

20 t * ^ M <blOCkS 107 ' 109 — »• in 

20 blocks 115 and 117, a k data ,k - i, i 000) a JT^ 

Vk. ST 1 f " Ched and ' in a «*»* ™- X ->ata 
and bfc data are added and the result of the addition 
Cfc is stored into c*. Then, in a block 121 the c is 
stored into the memory. * 

15 one T"' ' blOC * " 3 8 P ° lnteC K is incremented by 
one. i n a block 125 the pointer K is compared with the 

thTLV! ^ «ne„ the val^f 

the pointer E fails to reach the W, the control is 
returned to the block 115 where similar process and 
0 imputation are performed. The operation loop of the 

AZ llll T a " operation in Le 

flow chart shown i„ Pig . 8 . In the sla „ e 

however, the fetch and the addition of the data are 
overlapped in a pipeline „an„er and all o, the 
operations are competed within the access time of the 

main memory 35 „ - 
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An example of the addition program is illustrated 
in TABLE 1. The operation of the arithmetic logic 
section 25 and the memory request are described in the 
microprogram of the ROM 61. The basic cycle of the 
slave computer is 200 ns, for example, and the cycle 
time of the main memory is 800 ns, for example. 
Therefore , the slave computer can perform one step of 
the program for one memory cycle. 
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j ADDRESS 



ARITHMETIC 
SECTION 



TABLE 

MEMORY 
REQUEST/ 

synchronize! 



REMARKS 



BA+MAR 

BA incr 
nop 

AA+MAR 
MSR+R^ +R2 
AA incr 
nop 

►LOOP BA+mar 
MSR+R X 
BA incr 
R 2 *MDR 

CA+MAR 
MSR+R X +R 2 

CA incr 
k incr 
AA+MAR 

AA incr 
k-N 

BNZ LOOP 

STOP 



REQ(R) 
MSYN 



REQ(R) 
MSYN 



REQ(R) 
MSYN 



REQ(R) 
MSYN 



REQ(W) 
MSYN 



REQ(R) 
MSYN 



COUNT CLEAR, READ REQUEST 
SEND ai ADDRESS 
[UPDATE a ARRAY ADDRESS 

| READ .REQUEST 
SEND b£ ADDRESS 
TRANSFER a x DATA TO Rj 
UPDATE b ARRAY ADDRESS 
READ REQUEST 
SEND a£ ADDRESS 

b l +a l-* R 2( c l> 
UPDATE a ARRAY ADDRESS 
READ REQUEST 
SEND bi ADDRESS 
TRANSFER a £ DATA TO R x 
UPDATE b ARRAY ADDRESS 
SEND c i-x TO MDR 
SEND Cj_£ ADDRESS 
b i +a i**2( c i> 

UPDATE c ARRAY ADDRESS 
COUNT UP 

[SEND ai +1 ADDRESS 

| UPDATE a ARRAY ADDRESS 

L CHECK THE NUMBER 
' OF TIMES 

I END 
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Each microinstruction of the arithmetic routine has 
a memory control section and an arithmetic control 
section, as shown in. Fig. 9. In TABLE 1, the arithmetic 
section indicates the operations described in the 
arithmetic control section and the memory 
request/synchronize indicates the operations described 
in the table. The arithmetic control section has an OP 
code, a source address, and a destination address, which 
are usually used. The description of those will be 
omitted, however. The memory control section has three 
bits REQ, R/W and MSYN, as illustrated enlarged in* 
Pig. 9. The RBQ bit is a bit to direct the request to 
the main memory 35, the R/W bit is a bit to direct if 
the request is the read or the write for the main memory 
35, and the MSYN bit indicates the synchronization with" 
the memory cycle. Of those bits., the contents of the 
REQ bit and R/W bit are supplied to the DMA controller 
47 and the contents of the MSYN bit is supplied to the 
clock circuit 29. The DMA controller 47 produces a 
request signal and an R/W signal to the main memory 35 
in accordance with the contents of the REQ bit and the 
R/W bit. 

In TABLE 1, RBQ(R) indicates that the RBQ bit and 
the R/W bit are set (to Read). Similarly, REQ(W) 
indicates that REQ bit and R/W bit are set (to Write). 

Upon the memory request, the memory cycle of the 
main memory 35 starts. In this example, when the memory 
request with higher priority is received, the memory 
cycle is so programmed as to start delayed by two steps 
from the step which issued the memory request. In other 
words, after approximately 400 ns since the memory 
request is issued, an actual memory cycle starts 
corresponding to the memory request. 

In TABLE 1, symbols k, AA, BA, N, Rj.» *2 and ^ are 
names assigned to specific registers in the register 
file 57. In those names, k designates counters 
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corresponding to the number of operations; AA an address 
of the array data a? BA an address of the array data b; 
CA an address of the array data c; N the number of 
operations (the number of data) previously given by the 
table; R x and R 2 working registers. As the present 
embodiment uses the Am 2901 for the arithmetic section 
25, the MAR 49 and MDR. 51 are connected to the Y output 
port of the AM 2901, and the MSR 53 is connected to the 
D input port. Accordingly, the loading and the fetching 
operations of the contents of these registers are 
possible by the microprogram. 

In the microprogram in TABLE 1, the addresses from 
a to a+12 indicate an initializing step since it is 
impossible to process those addresses from the first 
15 data in the pipeline manner. The steps from the address * 
(a+13) to the address (a+24) indicates a loop in a 
normal state. 

In the cycle of the main memory 35, a* and bj 
(a and b represent "Read") are read such as (a 1# b x ), 
(a 2 r b 2 , c x ), (a 3 , b 3 , c 2 ), (a 4 , b 4 , c 3 ) ..i and 
c i-l obtained in the previous cycle is written. The 
process of four internal cycles is possible within one 
memory cycle. Accordingly, the address control of DMA 
and the count of the loop, and the like are overlapped 
with the memory cycle, as shown in the program, so that 
the computation is pipelined and all of the computations 
are completed within the access time to the memory. 

The microprogram in TABLE 1 will again be described 
with relation to the operation of the host. main memory 
30 35. One memory cycle of the host main memory 35 
corresponds to four steps of the microprogram. 
Accordingly, with MSYN of the address (a+1), the step of 
the address (a+2) is synchronized with the memory cycle. 
Thus, the memory cycle starts after two steps since the 
35 memory request is issued. Therefore, the time taken 

from the address (a+2) to the address (a+5) correspond 
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to one memory cycle and here the array data. a x is* Read. 
Likewise, the addressed (a+b) to (a+9) correspond to one 
memory cycle and here the array data is Read. 
Similarly, the addresses (a+10) to (a+13) are for Read 
5 of a 2 J the addresses (a+14) to (a+17) for Read of b t j 
the addresses (a+18) to (a+21) for Write of c^f the 
addresses (a+22) to (a+24) for Read of a i+1 . 

In the program, When the access request to the host 
main memory 35 is issued, the memory cycle starts 

10 immediately (after approximately 400 ns). The actual 
host main memory 35, however-, is commonly used by the 
CPU 3i, the slave computer and DMA devices such as a 
magnetic disc and a magnetic tape. The devices 
individually request services for the host main memory 

15 35 and exclusively use the memory cycle only when the 
request is accepted. Also during a period that the 
program shown in this example runs, it is impossible for 
the slave computer to continuously use the memory cycle i 
In this case, it must operate using the memory with any^ 

20 other device. To this end, the synchronizing circuit is 
provided in the slave computer so that from an instant 
that the access request to the host main memory 35 is 
issued till an acknowledge signal representing the 
request is accepted is received, the internal clock is 

25 stopped and the slave computer is operated always to 
synchronize with the memory cycle of the main memory 
assigned thereto. The operation timing diagram for 
taking such a synchronisation is illustrated in 
Figs. 10 A to 10 J. An example of the circuit 

30 construction of the clock circuit 29 which also serves 

as a synchronizing circuit is shown in Fig. 11. MCLK in 
Pig. 10B designates a clock signal generated by the 
oscillating circuit 81. The clock signal is 
frequency-divided by the flip-flops 83 and 85 thereby to 

35 form clock signals SCLK1 and SCLK2 • These signals 
SCLK1, SCLK2 are supplied to a disable flip-flop 87 
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through a logic multiplying circuit 88. The machine 
cycle is determined by the clock signal SCLK1. 

The memory synchronizing operation is performed by 
the microprogram as shown in Pig. 12, for example. That 
is at address b, an access request to the memory is 
issued and the machine cycle at the address b+1 should 
be elongated till the memory cycle is assigned to 
synchronize therewith. Accordingly, the memory 
synchronizing operation is specified (M, SYNC command). 
The execution of the program will be described using the 
timing chart. When the program in the address b+1 is 
executed after the program in the address b is executed, 
the M. and SYNC commands cause the MSYN signal to 
synchronize with the SCLK1 thereby to be active. 
The DISA flip-fiop 87 is set by the timing pulse TP 
formed by the SCLK1 and SCLK2, so that the output Q of 
the DISA flip-flop 54 resets the flip-flops 83 and 85.' 
Under this condition, the clock signals SCLKl and SCLK2 
are halted. Then, the main memory returns a signal ACT 
to accept the access to the memory, to the circuit. At 
the leading edge of the ACT signal, the ACT latch 89 is 
set. Since the ACT signal is of asynchronous, the ACT 
latch 89 is made to be synchronized with the MCLK and 
the DISA flip-flop 87 is reset by the output Q of the 
SYNC flip-flop 91. Dpon the reset, the output Q of the 
DISA flip-flop 87, which have reset the flip-flops 83 
and 85 becomes inactive, the SCLKl and SCLK2 become 
active again. At this time point, the execution of the 
program starts to progress and the execution of the 
program in the address b+2 is started. By performing 
such synchronizing operation, the internal operation may 
be performed with respect to the common use of the 
memory, correctly and smoothly. 

According to the invention, the slave computer has 
no internal memory to hold a large amount of data. 
Accordingly, a large amount of data may be processed at 
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a high speed by using an inexpensive slave computer, so 
that the computer system of the invention has a high 
perfonaance/cost ratio. In the above-mentioned 
embodiment, a space to occupy the slave computer on a 
single print substrate of 12 x 12 inches is almost same 
compared with the conventional one. The additional use 
of the slave computer never interfere the host computer 
system but takes charge of processing the picture data, 
and the array data, which are not well processed by the 
host computer thereby to improve the data processing 
speed o€ the whole system. The operation for the host 
computer to start the slave computer is simplified and^ 
this indicates that the computer system is easy to use_ 
from the host computer side. % 

A preferable application of the invention is a 
computed tomography (CT) which irradiates the X ray beam 
onto the exterior of a patient to collect the data 
transmitted therethrough and to obtain a tomogram of ae 
diseased part of the patient by processing the ^ 
transmitted data by means of a computer. To obtain the 
tomogram, a calculation called a back projection is 
performed. The amount of the data used in the back 
projection ranges from several tens thousand to several 
hundreds thousand. Such an enormous amount of data may 
be processed for a short time by the computer system of 
the present invention. The CT apparatus includes a 
computer system generally and has an extremely 
expensive. The use of the computer system of the 
invention reduces the cost of the CT apparatus. * 
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Claims: 



1. In a data processing system with a slave 
computer connecting to a host central processing unit 
and a host main memory, the improvement wherein said 
slave computer comprising: 

arithmetic logic operating means of which the 
operation cycle time is shorter than the memory cycle 
time of said host main memory, and computes the address 
of the host main memory; 

DMA interface means which is connected to said host 
main memory and said arithmetic logic operating means, 
directly makes an access to said host main memory by the 
address computed by said arithmetic logic operating 
means, and directly loads the result of said arithmetic 
logic operating means to said host main memory; 

microprogram control means connected to said 
arithmetic logic operating means and said DMA interface 
means for controlling those means; 

input/output interface means connected to said host 
central processing unit and said microprogram control 
means to apply command information received from said 
host central processing unit to said microprogram 
control means; and 

clock circuit means connected to said arithmetic 
logic operating means, said microprogram control means 
and said DMA interface means for generating system 
Clocks to operate those means. 

2. A data processing system according to claim 1, 
wherein the address computation for the host main memory 
and the computation of operand data taken out from said 
host main memory are performed together in said 
arithmetic logic operating means. 

3. A data processing system according to claim 1, 
wherein said clock circuit means includes means for 
inhibiting the generation of system clocks in order to 
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produce the system clock in synchronism with the memory 
clock cycle of said host main memory, 

4. A data processing system according to claim 1, 
wherein said microprogram control means includes a 
set-up routine and a microprogram memory for storing the 
arithmetic logic routine, whereby executing the set-up 
routine, a command information supplied from said host 
central processing unit is fetched to branch the program 
to an operation routine designated. 

5. A data processing system according to claim 1, 
wherein said DMA interface means comprises s 

a memory* address register for holding an address to 
fetch the data from said host main memory or an address 
to store data into said host main memory ; 

a memory data register for holding the result of 
computation into said host main memory; 

a memory storage register for holding the data 
fetched from said host main memory; and 

a DMA controller for holding a memory request 
signal and a read/write signal for said host main 
memory, and for receiving an acknowledge signal from 
said host main memory. 

6. A data processing system according to claim 1, 
wherein said microprogram control means comprises s 

a read only memory for storing a microprogram? 

a microprogram sequencer for supplying an address 
in which the microprogram is stored to said read only 
memory; 

a pipeline register for holding each instruction of 
the microprogram read out from said read only memory; 

a branch condition circuit for supplying control 
signals to said microprogram sequencer on the basis of 
the conditions produced from said arithmetic control 
means* 

7. A data processing system according to claim 6, 
wherein said branch condition circuit comprises? 
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a condition code selector which receives condition 
signals outputted from said input/output interface means 
and flag signals outputted from said arithmetic logic 
operating means, and selectively outputs those on the 
basis of control signals supplied from said microprogram 
control means; and 

a logic circuit connecting to the output of said 
condition code selector which receives a condition 
signal or a flag signal outputted from said condition 
code selector and control signal from said microprogram 
control means thereby to supply a branch specifying 
signal to said microprogram control means. 

8. a data processing system according to claim 1, 
wherein said clock circuit means comprises: 

an oscillator for oscillating basic clock signals/ 
first and second frequency-dividing flip-flop 
circuits which receive the basic clock signals from said 
oscillator and produce first and second 
frequency-divided clock signals; 

a logic multiplying circuit for receiving 
frequency-divided clock signals from said first and 
second flip-flops to produce a given timing pulse 
s ignal ; 

a disable flip-flop which receives an output signal 
from said logic multiplying circuit to reset said first 
and second frequency-divided flip-flops thereby to halt 
said frequency divided clock signals; 

a latch circuit which receives a memory access 
acknowledge signal from said host main memory to produce 
a logical high level signal; and 

a synchronous flip-flop circuit which receives an 
output signal from said latch circuit to supply a 
synchronous signal to said disable flip-flop and to 
reset said disable flip-flop thereby to make active said 
first and second frequency-divided flip-flops. 
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Data processing system with a slave computer. 

@ In a data processing system having a slave computer 
connecting to a host cemral processing unit ©1) end a host 
main memory (S5)» the slave computer includes an arithme- 
tic logic operating means (25). The eHthmeiie logic operating 
means (25) calculates an address of the host main memory 
(35)» A DMA interface (23) directly makes an access to the 
host main memory (3S) to fetch operand data Into the slave 
computer. The a rithm et ic logic operating means (25) com- 
putes the operand data under control of a microprogram 
control section (27) and dfrectty toads the computed result to 
the host main memory (39) through the DMA Interface (23). 
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